Lightweight Document Matching for Help-Desk Applications
نویسندگان
چکیده
AN EARLY GOAL FOR AI SYSTEMS was to provide a response when a user typed a problem description into a computer. With help desks, this classic problem emerges in modern form. One example is when a user enters a complete problem description and expects a program to provide help, often in the form of relevant documents from a database. The information-retrieval community has extensively addressed such problems. Most versions of solutions involve text-searchbased systems that accept as input a query or limited-length textual phrase, and produce as output a list of potentially relevant documents.1 Unfortunately, many such systems require substantial storage and computing resources. Also, the document representations and document-matching algorithms they employ can be complicated. Search engines are one embodiment of these systems. Typically, they have an index of most of their stored documents’ words, to which they match words from a query. A search algorithm attempting to identically match many input words will not likely find any documents for an exact match. In contrast, a document matcher accepts an entire, new document as input, so the query can have hundreds of words. In this article, we describe a completely automated Java-based document matcher that accepts an unlimited-length textural structure as input and employs a fast matching algorithm to produce, like a search engine, a ranked list of relevant documents. Our approach requires minimal processing and storage, and is therefore suitable for installation in restricted environments, such as Javacompatible mobile or small desktop computers (the document matcher can run on a large server, as well, but the approach we take here is effective even when resources are relatively scarce). Empirical results show that despite its lightweight algorithms, the method effectively fulfills its predictive-performance goals.
منابع مشابه
Automated generation of model cases for help-desk applications
Document databases may be ill formed containing redun dant and poorly organized documents For example a database of cus tomers descriptions of problems with products and the vendor s descrip tions of their resolution may contain many descriptions of the same prob lem A highly desirable goal is to transform the database into a concise set of summarized reports model cases which in turn are more ...
متن کاملThe Reading Desk: Supporting Lightweight Note-Taking in Digital Documents
When reading on paper, readers often write notes, fold corners or insert bookmarks without apparent conscious effort. Research into digital reading has discovered that electronic tools are far less intuitive, require significantly more attention, and are much less used. This paper introduces “The Digital Reading Desk” – a document reading interface that enhances existing digital reading interac...
متن کاملAt&t Help D
This paper introduces a new breed of natural language dialog applications which we refer to as the Help Desk. These voiceenabled applications are an evolution from Help Desk services that are currently available on the web or being supported by human agents. The goals of a voice-enabled Help Desk are to route calls to appropriate agents or departments, provide a wealth of information about vari...
متن کاملKnowledge management-centric help desk: specification and performance evaluation
The technology help desk function has grown in importance as information technology has proliferated throughout the organization. The primary objective of the help desk is to resolve problems related to IT in the organization. As such, the agents in the help desk must be very knowledgeable of the information systems, applications, and technologies supported. Most efforts at improving help desk ...
متن کاملTowards a Framework for Collating Help-desk Responses from Multiple Documents
Responses to help-desk email inquiries are often repetitive, sharing varying degrees of commonality. In addition, a significant proportion of the responses are generic, containing a very low level of technical content. In this paper, we present a corpus-based approach for identifying common elements in help-desk responses and using them to construct a new response. A help-desk domain is unique ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- IEEE Intelligent Systems
دوره 15 شماره
صفحات -
تاریخ انتشار 2000